Skip to content

Pipe: Utilize parallelStream for concurrent execution of create, start, stop, and drop pipe tasks to enhance performance#11892

Merged
SteveYurongSu merged 1 commit intoapache:masterfrom
VGalaxies:pipe-parallel-stream
Jan 14, 2024
Merged

Pipe: Utilize parallelStream for concurrent execution of create, start, stop, and drop pipe tasks to enhance performance#11892
SteveYurongSu merged 1 commit intoapache:masterfrom
VGalaxies:pipe-parallel-stream

Conversation

@VGalaxies
Copy link
Copy Markdown
Contributor

Description

Currently, when creating a pipe, if there is historical data in the cluster, the pipe will be automatically started. This involves serially extracting historical data in each data region. When dealing with large data, timeouts may occur, leading to pipe creation failure. To address this, we are considering parallelizing the above operations using parallelStream (inspired by @SteveYurongSu).


This PR has:

  • been self-reviewed.
    • concurrent read
    • concurrent write
    • concurrent read and write
  • added documentation for new or modified features or behaviors.
  • added Javadocs for most classes and all non-trivial methods.
  • added or updated version, license, or notice information
  • added comments explaining the "why" and the intent of the code wherever would not be obvious
    for an unfamiliar reader.
  • added unit tests or modified existing tests to cover new code paths, ensuring the threshold
    for code coverage.
  • added integration tests.
  • been tested in a test IoTDB cluster.

Key changed/added classes (or packages if there are too many classes) in this PR

@SteveYurongSu SteveYurongSu self-assigned this Jan 14, 2024
Copy link
Copy Markdown
Member

@SteveYurongSu SteveYurongSu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:)

@SteveYurongSu SteveYurongSu merged commit aeeb0ba into apache:master Jan 14, 2024
@VGalaxies VGalaxies deleted the pipe-parallel-stream branch January 18, 2024 08:55
SzyWilliam pushed a commit to SzyWilliam/iotdb that referenced this pull request Nov 26, 2024
…t, stop, and drop pipe tasks to enhance performance (apache#11892)

Currently, when creating a pipe, if there is historical data in the cluster, the pipe will be automatically started. This involves **serially** extracting historical data in each data region. When dealing with large data, timeouts may occur, leading to pipe creation failure. To address this, we are considering parallelizing the above operations using `parallelStream` (inspired by @SteveYurongSu).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants